A Dual Channel Coupled Decoder for Fillers and Feedback
نویسندگان
چکیده
This study presents a dual channel decoder capable of modeling cross-speaker dependencies for segmentation and classification of fillers and feedbacks in conversational speech found in the DEAL corpus. For the same number of Gaussians per state, we have shown improvement in terms of average F-score for the successive addition of 1) increased frame rate from 10 ms to 50 ms 2) Joint Maximum Cross-Correlation (JMXC) features in a single channel decoder 3) a joint transition matrix which captures dependencies symmetrically across the two channels 4) coupled acoustic model retraining symmetrically across the two channels. The final step gives a relative improvement of over 100% for fillers and feedbacks compared to our previous published results. The F-scores are in the range to make it possible to use the decoder as both a voice activity detector and an illucotary act decoder for semi-automatic annotation.
منابع مشابه
Modeling conversational interaction using coupled Markov chains
This paper presents a series of experiments on automatic transcription and classification of fillers and feedbacks in conversational speech corpora. A feature combination of PCA projected normalized F0 Constant-Q Cepstra and MFCCs has shown to be effective for standard Hidden Markov Models (HMM). We demonstrate how to model both speaker channel with coupled HMMs and show expected improvements. ...
متن کاملAn adaptive error-resilient video encoder
When designing an encoder for a real-time video application over a wireless channel, we must take into consideration the unpredictable fluctuation of the quality of the channel and its impact on the transmitted video data. This uncertainty motivates the development of an adaptive video encoding mechanism that can compensate for the infidelity caused either by data loss and/or by the post-proces...
متن کاملScalable layered space-time codes for wireless communications: performance analysis and design criteria
− Dual antenna-array systems provide very high capacity compared to single antenna systems in a Rayleigh fading environment. If the transmitter does not have channel state information, to utilize this high capacity, space-time codes must be employed. The diagonally layered space-time (DLST) architecture is a structure that is capable of providing a high data rate for a low decoding complexity. ...
متن کاملBounds for Multiple-Access Relay Channels with Feedback via Two-way Relay Channel
In this study, we introduce a new two-way relay channel and obtain an inner bound and an outer bound for the discrete and memoryless multiple access relay channels with receiver-source feedback via two-way relay channel in which end nodes exchange signals by a relay node. And we extend these results to the Gaussian case. By numerical computing, we show that our inner bound is the same with o...
متن کاملLow-Complexity LDPC-Coded USTM Noncohereny MIMO Receivers
This paper proposes a scheme of combining low-density parity check (LDPC) code with unitary space time modulation (USTM) for noncoherent multiple-input-multiple-output (MIMO) transmitter and receiver over Rayleigh block fading and additive white Gaussian noise (AWGN) channel. The main aim is to design the low complexity coded noncoherent MIMO receiver which is completely dependent on the struct...
متن کامل